Segmenting Hashtags using Automatically Created Training Data

نویسندگان

  • Arda Çelebi
  • Arzucan Özgür
چکیده

1. Hashtags increasingly used to convey the actual message in tweets. Phrases and sentences turned into a hashtag. 2. Word with sentiment may trap inside a multi-word hashtag 3. Noisy and compact nature of language leads to hashtags very difficult to segment; sometimes depends on context. eg. #together; “to get her” or “together”? 4. Can we use carefully auto-segmented hashtags for training? RELATED WORK

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Segmenting Twitter Hashtags

Social Media Posts On Platforms Such As Twitter Or Instagram Use Hashtags, Which Are Author-Created Labels Representing Topics Or Themes, Toassist In Categorization Of Posts And Searches For Posts Of Interest. The Structural Analysis Of Hashtags Is Necessary As Precursor To Understandingtheir Meanings. This Paper Describes Our Work On Segmenting Nondelimited Strings Of Hashtag-Type English Text...

متن کامل

Bootstrapped Learning of Emotion Hashtags #hashtags4you

We present a bootstrapping algorithm to automatically learn hashtags that convey emotion. Using the bootstrapping framework, we learn lists of emotion hashtags from unlabeled tweets. Our approach starts with a small number of seed hashtags for each emotion, which we use to automatically label tweets as initial training data. We then train emotion classifiers and use them to identify and score c...

متن کامل

Segmenting Hashtags and Analyzing Their Grammatical Structure

Originated as a label to mark specific tweets, hashtags are increasingly used to convey messages that people like to see in the trending hashtags list. Complex noun phrases and even sentences can be turned into a hashtag. Breaking hashtags into their words is a challenging task due to the irregular and compact nature of the language used in Twitter. In this study, we investigate feature-based m...

متن کامل

Towards Deep Semantic Analysis of Hashtags

Hashtags are semantico-syntactic constructs used across various social networking and microblogging platforms to enable users to start a topic specific discussion or classify a post into a desired category. Segmenting and linking the entities present within the hashtags could therefore help in better understanding and extraction of information shared across the social media. However, due to lac...

متن کامل

Harnessing Twitter "Big Data" for Automatic Emotion Identification

User generated content on Twitter (produced at an enormous rate of 340 million tweets per day) provides a rich source for gleaning people’s emotions, which is necessary for deeper understanding of people’s behaviors and actions. Extant studies on emotion identification lack comprehensive coverage of “emotional situations” because they use relatively small training datasets. To overcome this bot...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016